Extracellular mucin detected by deep learning from histopathology images predicts consensus molecular subtypes in colorectal cancer

Oxana (Sachenkova) Lundström1 (oxana.lundstrom@dbb.su.se, @merenlin), H-G Nguyen2, Maria Anisimova3, Inti Zlobec2
  1. Stockholm University
  2. Institute of Pathology, University of Bern
  3. Institute of Applied Simulations, ZHAW Wädenswil

Mucin in colon and rectal cancer leads to poor outcome for certain types of tumors. It can also help us predict tumor type and classify patients to improve treatment.

Abstract

Colorectal cancers are a heterogeneous group of tumors, from the histomorphological, clinical and molecular points-of-view. In terms of molecular changes, one particularly well-described genomic aberration is microsatellite instability (MSI), affecting approximately 15% of all cancers and is an important factor affecting treatment decisions in colorectal cancer. Attempts have been made to predict MSI cancers using only the histopathology image (Bychkovetal 2018). In fact, experienced pathologists can often identify MSI cases simply be glancing at the stained slide. Mucin is not exclusively found in MSI-high patients, and the prognosis associated with mucinous histology is controversial.

Methods

In order to better understand the role of extracellular mucin in colorectal cancer, we created a deep learning classifier to quantify the extracellular mucin-to-tumor area ratio in three independent cohorts (Bern n=517, TCGA n=412 & CPTAC n=85) and investigate the genotype-phenotype correlation of mucin with CMS groups, MSI status and gene expression of mucin-producing genes.

Bern cohort

A large retrospective cohort of 517 primary colorectal cancer patients diagnosed at the Institute of Pathology between 2002-2018 and treated at the Insel Hospital Bern (Switzerland)

TCGA

The Cancer Genome Atlas (TCGA) data (Cancer Genome Atlas 2012) from public repositories at the National Institutes of Health (NIH; USA). 412 patients cases have been selected.

CPTAC

A public dataset The Clinical Proteomic Tumor Analysis Consortium (CPTAC) from the National Cancer Institute (NCI; USA) was also included in this study. The 373 H&E WSIs of 106 colon adenocarcinoma patients were downloaded from The Cancer Imaging Archive (TCIA) (National Cancer Institute Clinical Proteomic Tumor Analysis Consortium 2020).

Weakly supervised tissues segmentation from prior information

A novel precise tissue segmentation of histopathology images using deep learning, namely Group Affinity Weakly Supervised segmentation (GAWS) is proposed. It processes one histopathology image and some patches of prior tissue as input with three main steps.

Evaluation of the method

First, we tested the inter-observer agreement of extracellular mucin component, recorded as the percentage of total tumor area covered by extracellular mucin, between two pathologists. The results were excellent [pathologist 1 ICC=0.915 (95%CI: 0.885-0.937) and pathologist 2 ICC=0.923 (95%CI: 0.896-0.943)].

Results

Mucinous histology and survival outcome

In CMS1 patients, mucinous histology is clearly a favorable factor, whereas in stage II and CMS3 patients, patients experience a much more unfavorable outcome. This may help to add to the current discrepancies found in the literature with regard to the impact of this histological feature and outcome (REF).

Mucin is independent of MSI

Mucinous histology is independent of microsatellite instability(MSI) status. On the one hand, mucin is often a feature of MSI, and a likely consequence of genomic instability affecting mucin-producing genes. On the other, it occurs in non-MSI cases as well. Our work here based on only 2 features, namely MSI status and the presence of extracellular mucin, helps to narrow down identification of CMS groups. Nearly all “mucinous” tumors with MSS are CMS3 or CMS4.

Association of expression of mucin-associated genes and CMS groups

In this study, we show that mucinous histology and moreover, the presence of extracellular mucin in colorectal cancer, can help predict the underlying Consensus Molecular Subgroup(CMS) group, independently of the MSI status. CMS2 cancers rarely show any mucin, a result validated by gene expression analysis of four mucin producing genes - MUC2, MUC4, MUC5AC and MUC5B.

Conclusion

Extracellular mucin is an indicator of a phenotype-genotype correlation and together with MSI status can differentiate between CMS1 and CMS3 cancers. Secondly, extracellular mucin is a feature of poor outcome in patients with stage II cancers and CMS3 tumors, despite the more indolent features associated with a mucin-producing phenotype and should be considered during clinical management.

References

Bychkovetal, D. 2018. “Deep Learning Based Tissue Analysis Predicts Outcome in Colorectal Cancer.” Scientific Reports 8: no. 1. https://doi.org/10.1038/s41598-018-21758-3.
Cancer Genome Atlas, N. 2012. “Comprehensive Molecular Characterization of Human Colon and Rectal Cancer.” Nature, no. 487: 330–37. https://doi.org/10.1038/nature11252.
National Cancer Institute Clinical Proteomic Tumor Analysis Consortium, (CPTAC). 2020. “Data from the Clinical Proteomic Tumor Analysis Consortium Colon Adenocarcinoma Collection.” The Cancer Imaging Archive. https://doi.org/10.1038/nature11252.